Search CORE

8 research outputs found

Simulation of integrate-and-fire neuron circuits using HfO₂-based ferroelectric field effect transistors

Author: Bertele Martin
Breyer Evelyn T.
Chicca Elisabetta
Klein Philipp
Mikolajick Thomas
Mulaosmanovic Halid
Slesazeck Stefan
Suresh Bharathwaj
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 03/01/2022
Field of study

Inspired by neurobiological systems, Spiking Neural Networks (SNNs) are gaining an increasing interest in the field of bio-inspired machine learning. Neurons, as central processing and short-term memory units of biological neural systems, are thus at the forefront of cutting-edge research approaches. The realization of CMOS circuits replicating neuronal features, namely the integration of action potentials and firing according to the all-or-nothing law, imposes various challenges like large area and power consumption. The non-volatile storage of polarization states and accumulative switching behavior of nanoscale HfO₂ - based Ferroelectric Field-Effect Transistors (FeFETs), promise to circumvent these issues. In this paper, we propose two FeFET-based neuronal circuits emulating the Integrate-and-Fire (I&F) behavior of biological neurons on the basis of SPICE simulations. Additionally, modulating the depolarization of the FeFETs enables the replication of a biology-based concept known as membrane leakage. The presented capacitor-free implementation is crucial for the development of neuromorphic systems that allow more complex features at a given area and power constraint

Qucosa

HSSS - Hochschulschriftenserver der SLUB

Technische Universität Dresden: Qucosa

ApHMM: Accelerating Profile Hidden Markov Models for Fast and Energy-Efficient Genome Analysis

Author: Alser Mohammed
Cali Damla Senol
Cavlak Meryem Banu
Firtina Can
Kalsi Gurpreet S.
Kim Jeremie
Lindegger Joel
Luna Juan Gómez
Mutlu Onur
Pillai Kamlesh
Shahroodi Taha
Subramoney Sreenivas
Suresh Bharathwaj
Publication venue
Publication date: 21/10/2023
Field of study

Profile hidden Markov models (pHMMs) are widely employed in various bioinformatics applications to identify similarities between biological sequences, such as DNA or protein sequences. In pHMMs, sequences are represented as graph structures. These probabilities are subsequently used to compute the similarity score between a sequence and a pHMM graph. The Baum-Welch algorithm, a prevalent and highly accurate method, utilizes these probabilities to optimize and compute similarity scores. However, the Baum-Welch algorithm is computationally intensive, and existing solutions offer either software-only or hardware-only approaches with fixed pHMM designs. We identify an urgent need for a flexible, high-performance, and energy-efficient HW/SW co-design to address the major inefficiencies in the Baum-Welch algorithm for pHMMs. We introduce ApHMM, the first flexible acceleration framework designed to significantly reduce both computational and energy overheads associated with the Baum-Welch algorithm for pHMMs. ApHMM tackles the major inefficiencies in the Baum-Welch algorithm by 1) designing flexible hardware to accommodate various pHMM designs, 2) exploiting predictable data dependency patterns through on-chip memory with memoization techniques, 3) rapidly filtering out negligible computations using a hardware-based filter, and 4) minimizing redundant computations. ApHMM achieves substantial speedups of 15.55x - 260.03x, 1.83x - 5.34x, and 27.97x when compared to CPU, GPU, and FPGA implementations of the Baum-Welch algorithm, respectively. ApHMM outperforms state-of-the-art CPU implementations in three key bioinformatics applications: 1) error correction, 2) protein family search, and 3) multiple sequence alignment, by 1.29x - 59.94x, 1.03x - 1.75x, and 1.03x - 1.95x, respectively, while improving their energy efficiency by 64.24x - 115.46x, 1.75x, 1.96x.Comment: Accepted to ACM TAC

arXiv.org e-Print Archive

Early Prediction of DNN Activation Using Hierarchical Computations

Author: Avishaii Abuhatzera
Bharathwaj Suresh
Gurpreet Singh Kalsi
Kamlesh Pillai
Sreenivas Subramoney
Publication venue: MDPI AG
Publication date: 01/12/2021
Field of study

Deep Neural Networks (DNNs) have set state-of-the-art performance numbers in diverse fields of electronics (computer vision, voice recognition), biology, bioinformatics, etc. However, the process of learning (training) from the data and application of the learnt information (inference) process requires huge computational resources. Approximate computing is a common method to reduce computation cost, but it introduces loss in task accuracy, which limits their application. Using an inherent property of Rectified Linear Unit (ReLU), a popular activation function, we propose a mathematical model to perform MAC operation using reduced precision for predicting negative values early. We also propose a method to perform hierarchical computation to achieve the same results as IEEE754 full precision compute. Applying this method on ResNet50 and VGG16 shows that up to 80% of ReLU zeros (which is 50% of all ReLU outputs) can be predicted and detected early by using just 3 out of 23 mantissa bits. This method is equally applicable to other floating-point representations

Directory of Open Access Journals

Early Prediction of DNN Activation Using Hierarchical Computations

Author: Avishaii Abuhatzera
Bharathwaj Suresh
Gurpreet Singh Kalsi
Kamlesh Pillai
Sreenivas Subramoney
Publication venue: 'MDPI AG'
Publication date: 04/12/2021
Field of study

Multidisciplinary Digital Publishing Institute

Simulation of integrate-and-fire neuron circuits using HfO2-based ferroelectric field effect transistors

Author: Bertele Martin
Breyer Evelyn T.
Chicca Elisabetta
Klein Philipp
Mikolajick Thomas
Mulaosmanovic Halid
Slesazeck Stefan
Suresh Bharathwaj
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2019
Field of study

Suresh B, Bertele M, Breyer ET, et al. Simulation of integrate-and-fire neuron circuits using HfO2-based ferroelectric field effect transistors. 2019 26th IEEE International Conference on Electronics, Circuits and Systems (ICECS). 2019

Crossref

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Publications at Bielefeld University

Dissertations of the University of Groningen

Efficient Resistive Switching and Spike Rate Dependent Plasticity in a New CuCrO2 Memristor for Plausible Neuromorphic Systems

Author: Biswas Pranab
Boppidi Pavan Kumar Reddy
Kundu Souvik
Mullarkey Daragh
Raj P. Michael Preetam
Shvets Igor V.
Suresh Bharathwaj
Zhussupbekova Ainur
Publication venue
Publication date: 01/01/2020
Field of study

In this work, we introduce a new class of p-type transparent conductive oxide (TCO) CuCrO2 (150 nm) heterogeneously integrated onto FTO/glass for forming free memristor based neuromorphic applications. The fabricated Al/CuCrO2/FTO memristors demonstrate a reliable bipolar resistive switching with an ON/OFF ratio of 1000. The retention of the device was found to be steady even beyond 106 s, which demonstrates its non-volatility. The current-voltage (I-V) characteristics were fitted to evaluate its transport properties and a band-diagram was projected to have a better insight of the device operational principles. To validate the experimental observations, a new model has been developed, and the simulated I-V behavior was analogues to the experimental one. Efforts were then devoted to observe long-term potentiation (LTP) and longterm depression (LTD) utilizing identical but opposite pulses to evaluate the device?s efficacy for synaptic applications. The synaptic behavior was well controlled by the pulse (pulse amplitude and width) variations. The conductance change was found to be symmetric and then saturated, which reflects the popular biological Hebbian rules. Finally, a long-term synaptic modulation has been implemented by establishing the spike rate dependent plasticity (SRDP) rule, which is a part of spiking neural networks and advantageous to mimic the brain?s capability at low power. All the obtained experimental results were systematically corroborated by neural network simulation. Overall, our approach provides a new road map towards the development of TCO based alternative memristors, which can be employed to mimic the synaptic plasticity for energy-efficient bioinspired neuromorphic systems and non-Von Neumann computer architectures

Irish Universities

Efficient Resistive Switching and Spike Rate Dependent Plasticity in a New CuCrO 2

Author: Ainur Zhussupbekova
Bharathwaj Suresh
Daragh Mullarkey
Igor V. Shvets
P. Michael Preetam Raj
Pavan Kumar Reddy Boppidi
Pranab Biswas
Souvik Kundu
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

ApHMM: Accelerating Profile Hidden Markov Models for Fast and Energy-Efficient Genome Analysis

Author: Alser Mohammed
Cavlak Meryem Banu
Firtina Can
Gómez Luna Juan
Kalsi Gurpreet S.
Kim Jeremie S.
Lindegger Joël
Mutlu Onur
Pillai Kamlesh
Senol Cali Damla
Shahroodi Taha
Subramoney Sreenivas
Suresh Bharathwaj
Publication venue: Cornell University
Publication date: 20/07/2022
Field of study

Profile hidden Markov models (pHMMs) are widely used in many bioinformatics applications to accurately identify similarities between biological sequences (e.g., DNA or protein sequences). PHMMs use a commonly-adopted and highly-accurate method, called the Baum-Welch algorithm, to calculate these similarities. However, the Baum-Welch algorithm is computationally expensive, and existing works provide either software- or hardware-only solutions for a fixed pHMM design. When we analyze the state-of-the-art works, we find that there is a pressing need for a flexible, high-performant, and energy-efficient hardware-software co-design to efficiently and effectively solve all the major inefficiencies in the Baum-Welch algorithm for pHMMs. We propose ApHMM, the first flexible acceleration framework that can significantly reduce computational and energy overheads of the Baum-Welch algorithm for pHMMs. ApHMM leverages hardware-software co-design to solve the major inefficiencies in the Baum-Welch algorithm by 1) designing a flexible hardware to support different pHMMs designs, 2) exploiting the predictable data dependency pattern in an on-chip memory with memoization techniques, 3) quickly eliminating negligible computations with a hardware-based filter, and 4) minimizing the redundant computations. We implement our 1) hardware-software optimizations on a specialized hardware and 2) software optimizations for GPUs to provide the first flexible Baum-Welch accelerator for pHMMs. ApHMM provides significant speedups of 15.55x-260.03x, 1.83x-5.34x, and 27.97x compared to CPU, GPU, and FPGA implementations of the Baum-Welch algorithm, respectively. ApHMM outperforms the state-of-the-art CPU implementations of three important bioinformatics applications, 1) error correction, 2) protein family search, and 3) multiple sequence alignment, by 1.29x-59.94x, 1.03x-1.75x, and 1.03x-1.95x, respectively

Repository for Publications and Research Data